Enhancing neural-network performance via assortativity 
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The performance of attractor neural networks has been shown to depend crucially on the het- 
erogeneity of the underlying topology. We take this analysis a step further by examining the effect 
of degree-degree correlations - or assortativity - on neural-network behavior. We make use of a 
method recently put forward for studying correlated networks and dynamics thereon, both ana- 
lytically and computationally, which is independent of how the topology may have evolved. We 
show how the robustness to noise is greatly enhanced in assortative (positively correlated) neural 
networks, especially if it is the hub neurons that store the information. 

PACS numbers: 64.60. aq, 84.35. +i, 89.75.Fb, 87.85.dm 



I. BACKGROUND 



For a dozen years or so now, the study of complex 
systems has been heavily influenced by results from net- 
work science - which one might regard as the fusion of 
graph theory with statistical physics [3, Q ■ Phenomena 
as diverse as epidemics 3, cellular function Q, power- 
grid failures [f| or internet routing Q , among many oth- 
ers 0, depend crucially on the structure of the under- 
lying network of interactions. One of the earliest sys- 
tems to have been described as a network was the brain, 
which is made up of a great many neurons connected to 
each other by synapses Mathematically, the first 

neural networks combined the Ising model [12| with the 
Hebb learning rule [l3T ] to reproduce, very successfully, 
the storage and retrieval of information Neu- 
rons were simplified to binary variables (like Ising spins) 
representing firing or non- firing cells. By considering the 
trivial fully-connected topology, exact solutions could be 
reached, which at the time seemed more important than 
attempting to introduce biological realism. Subsequent 
work has tended to focus on considering richer dynamics 
for the individual cells rather than on the way in which 
these are interconnected [III E3, EH • However, the topol- 
ogy of the brain - whether at the level of neurons and 
synapses, cortical areas or functional connections - is ob- 
viously far from trivial [l9l - l24l |. 



The number of neighbors a given node in a network 
has is called its degree, and much attention is paid to 
degree distributions since they tend to be highly hetero- 
geneous for most real networks. In fact, they are often 
approximately scale-free (i.e., described by power laws) 
Pi B HE El- By including this topolo gical feature in 
a Hopfield-like neural-network model, Torres et al. [27| 
found that degree heterogeneity increases the system's 
performance at high levels of noise, since the hubs (high 
degree nodes) are able to retain information at levels well 
above the usual critical noise. To prove this analytically, 
the authors considered the configurational ensemble of 
networks (the set of random networks with a given de- 
gree distribution but no degree-degree correlations) and 



showed that Monte Carlo (MC) simulations were in good 
agreement with mean-field analysis, despite the approx- 
imation inherent to the latter technique when the net- 
work is not fully connected. A similar approach can also 
be used to show how heterogeneity may be advantageous 
for the performance of certain tasks in models with a 
richer dynamics [28( . It is worth mentioning that this in- 
fluence of the degree distribution on dynamical behavior 
is found in many other settings, such as the more general 
situation of systems of coupled oscillators [2j| . 

Another property of empirical networks that is quite 
ubiquitous is the existence of correlations between the de- 
grees of nodes and those of their neighbors [3(| HH . If the 
average degree-degree correlation is positive the network 
is said to be assortative, while it is called dis assortative if 
negatively correlated. Most heterogeneous networks are 
disassortative [l|, which seems to be because this is in 
some sense their equilibrium (maximum entropy) state 
give n the constraints imposed by the degree distribution 
[32|. However, there are probably often mechanisms at 
work which drive systems from equilibrium by inducing 
different correlations, as appears to be the case for most 
social networks, in which nodes (people) of a kind tend 
to group together. This feature, known as assortativity 
or mixing by degree, is also relevant for processes taking 
place on networks. For instance, assortative networks 
have lower percolation thresholds and are more robust to 
targeted attack [3l[, while disassortative ones make for 
more stable ecosystems and are - at least according to 
the usual definition - more synchronizable [331 ] . 

The approach usually taken when studying correlated 
networks computationally is to generate a network from 
the configuration ensemble and then introduce correla- 
tions (positive or negative) by some stochastic rewiring 
process [HI . A drawback of this method, however, is 
that results may well then depend on the details of this 
mechanism: there is no guarantee that one is correctly 
sampling the phase space of networks with given correla- 
tions. For analytical work, some kind of hidden variables 
from which the correlations originate are often consid- 
ered [35143H ] - an assumption which can also be used to 
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generate correlated networks computationally 37]. This 
can be a very powerful method for solving specific net- 
work models. However, it may not be appropriate if 
one wishes to consider all possible networks with given 
degree-degree correlations, independently of how these 
may have arisen. Here we get round this problem by 
ma king use of a method recently suggested by Johnson et 
al. [32j | whereby the ensemble of all networks with given 
correlations can be considered theoretically without re- 
curring to hidden variables. Furthermore, we show how 
this approach can be used computationally to generate 
random networks that are representative of the ensem- 
ble of interest (i.e., they are model- independent). In this 
way, we study the effect of correlations on a simple neu- 
ral network model and find that assortativity increases 
performance in the face of noise - particularly if it is the 
hubs that are mainly responsible for storing information 
(and it is worth mentioning that there is experimental 
evidence suggestive of a main functional role played by 
hub neurons in the brain [3f| Ho|). The good agreement 
between the mean-field analysis and our MC simulations 
bears witness both to the robustness of the results as re- 
gards neural systems, and to the viability of using this 
method for studying dynamics on correlated networks. 



shall consider the weights to be of the form = ujij , 
where the element &ij of the adjacency matrix repre- 
sents the number of directed edges (usually interpreted 
as synapses in a neural network) from node j to node i, 
while Cj stores the patterns, as before: 



For the sake of coherence with previous work, we shall 
assume a to be symmetric (i.e., the network is undi- 
rected), so each node is characterized by a single degree 
ki = ctij. However, all results are easily extended to 
directed networks - in which nodes have both an in de- 
gree, k\ n — an d an ou t degree, fcf ut = J2j ®ji 
- by bearing in mind it is only a neuron's pre-synaptic 
neighbors that influence its behavior. The mean degree 
of the network is (k), where the angles stand for an av- 
erage over nodes: (•) = TV -1 [Si- 



Network ensembles 



II. PRELIMINARY CONSIDERATIONS 
A. Model neurons on networks 

The attractor neural network model put forward by 
Hopfield [HI consists of TV binary neurons, each with 
an activity given by the dynamic variable Sj = ±1. 
Every time step (MCS), each neuron is updated ac- 
cording to the stochastic transition probability P(si — ¥ 
±1) = | [1 ± tanh (hi/T)] (parallel dynamics), where the 
field hi is the combined effect on i of all its neighbors, 
hi = ibijSj, and T is a noise parameter we shall call 
temperature, but which represents any kind of random 
fluctuations in the environment. This is the same as 
the Ising model for magnetic systems, and the transi- 
tion rule can be derived from a simple interaction energy 
such that aligned variables s (spins) contribute less en- 
ergy than if they were to take opposite values. However, 
this system can store P given configurations (memory 
patterns) £f = ±1 by having the interaction stren gth s 
(synaptic weights) set according to the Hebb rule }13| : 

Wij oc Y?,v=i£i£j ■ I n this way, each pattern becomes 
an attractor of the dynamics, and the system will evolve 
towards whichever one is closest to the initial state it is 
placed in. This mechanism is called associative memory, 
and is nowadays used routinely for tasks such as image 
identification. What is more, it has been established that 
something similar to the Hebb rule is implemented in 
nature via the processes of long-term potentiation and 
depression at the synapses 1411 . and this phenomenon is 
indeed required for learning |42J. 

To take into account the topology of the network, we 



When one wishes to consider a set of networks which 
are randomly wired while respecting certain constraints 
- that is, an ensemble - it is usually useful to define the 
expected value of the adjacency matrix, E(a) = i [iij . 
The element of this matrix is the mean value of 
obtained by averaging over the ensemble. For instance, 
in the Erdos-Renyi (ER) ensemble all elements (outside 
the diagonal) take the value ef R = (k)/N, which is the 
probability that a given pair of nodes be connected by 
an edge. For studying networks with a given degree se- 
quence, (ki, ...fcjv), it is common to assume the configu- 
ration ensemble, defined as 

hh ■ 

con J rvi^j 

^ ~ Jk)N 

This expression can usually be applied also when the con- 
straint is a given degree distribution, p(k), by integrating 
over p(ki) and p(kj) where appropriate. One way of de- 
riving i con f is to assume one has ki dangling half-edges 
at each node i; we then randomly choose pairs of half- 
edges and join them together until the network is wired 
up. Each time we do this, the probability that we join i 
to j is kikj j ((k)N) 2 , and we must perform the operation 
(k)N times. Bianconi showed that this is also the solu- 
tion for Barabasi- Albert evolved networks [46|. However, 
we should bear in mind that this result is only strictly 
valid for networks constructed in certain particular ways, 
such as in these examples. It is often implicitly assumed 
that were we to average over all random networks with 
a given degree distribution, the mean adjacency matrix 
obtained would be i con f . As we shall see, however, this 
is not necessarily the case [32j . 
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FIG. 1: Mean-nearest-neighbor functions k nn (k) for scale- 
free networks with /3 = —0.5 (disassortative), 0.0 (neutral), 
and 0.5 assortative, generated according to the algorithm de- 
scribed in Sec. IIII Bl Inset: degree distribution (the same in 
all three cases). Other parameters are 7 = 2.5, (k) = 12.5, 
TV = 10 4 . 



Correlated networks 



In the configuration ensemble, the expected value of 
the mean degree of the neighbors of a given node is 
k 



1 J2j^ij U ^j — (k 2 )/{k), which is indepen- 
However, as mentioned above, real networks 



dent of ki 

often display degree-degree correlations, with the result 
that fc nTSl i = k nn (ki). If k nn (k) increases with k, the 
network is said to be assortative - whereas it is disassor- 
tative if it decreases with k (see Fig. Q}. This is from the 
more general nomenclature (borrowed form sociology) in 
which sets are assortative if elements of a kind group 
together, or assort. In the case of degree-degree corre- 
lated networks, positive assortativity means that edges 
are more than randomly likely to occur between nodes of 
a similar degree. A popular measure of this phenomenon 
is Pearson's coefficient applied to the edges [l|, HI, [3l| : 
r = {[hk[] - [h] 2 )/([kf] - [ki] 2 ), where h and k[ are the 
degrees of each of the two nodes belonging to edge I, and 
[•] = ((fyN)^ 1 Ylii') i s an average over edges. 

The ensemble of all networks with a given degree se- 
quence (fci, ...fejv) contains a subset for all members of 
which k nn (k) is constant (the configuration ensemble), 
but also subsets displaying other functions k nn (k). We 
can identify each one of these subsets (regions of phase 
space) with an expected adjacency matrix e which simul- 
taneously satisfies the following conditions: i) Ylj kjtij = 
kik nn {ki), Mi (by definition of k nn (k)), and ii) £\ = 
ki, Vz (for consistency). An ansatz which fulfills these 
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where and the function f(v) is in general arbitrary 

32]. (If the network were directed, then ki — k\ n and 
kj = fc° u ^ in this expression.) This ansatz yields 



knn(k) 



(k 2 



(k) 



+ / dvj(y)a h 



{k" 



(2) 



(the first term being the result for the configuration 
ensemble), where <Jb+i = (k b+1 ) — (k)(k b ). To prove 
the uniqueness of a matrix e obtained in this way (i.e., 
that it is the only one compatible with a given k nn {k)) 
assume that there exists another valid matrix e' =/= e. 



Writing 



h{ki,kj) = hij, then Condition 



i) implies that • kjhij = 0, Vi, while Condition ii) 
means that J2j hij = 0, Vi. It follows that hij = 0, 
Vi,j. This means that e is not just one possible way 
of obtaining correlations according to k nn (k); rather, 
there is a two-way mapping between e and k nn (k): every 
network with this particular function k nn (k) and no 
other ones are contained in the ensemble defined by 
e. Thanks to this, if we are able to consider random 
networks drawn according to this matrix (whether we 
do this analytically or computationally; see Section 
IIII Bp . we can be confident that we are correctly taking 
account of the whole ensemble of interest. In other 
words, whatever the reasons behind the existence of 
degree-degree correlations in a given network, we can 
study the effects of these with only information on p(k) 
and k nn (k) by obtaining the associated matrix e. This 
is not to say, of course, that all topological properties 
are captured in this way: a particular network may 
have other features - such as higher order correlations, 
modularity, etc. - the consideration of which would 
require concentrating on a sub-partition of those with 
the same p(k) and k nn {k). But this is not our purpose 
here. 



In many empirical networks, k nn (k) has the form 
k nn (k) = A+ Bk?, with A,B > [1 113 - the mix- 
ing being assortative if /3 is positive, and disassortative 
when negative. Such a case is fitted by Eq. @ if 



m = c 



(T2 



C/3+2 



S(f - p - 1) - &{y - 1) 



with C a positive constant, since this choice yields 



- (k 2 ) 
k nn (k) = ^ + Ca 2 



k@ 



I 



(3) 



(4) 



Johnson et al. [32[ obtained the entropy of ensembles 
of networks with scale-free degree distributions (p(k) ~ 
k~ 7 ) and correlations given by Eq. and found that 



4 



the most likely configurations (those maximizing the en- 
tropy) generally correspond to correlated networks. In 
particular, the expected mixing, all other things being 
equal, is usually a certain degree of disassortativity - 
which explains the predominance of these networks in the 
real world. They also showed that the maximum entropy 
is usually obtained for values of C close to one. Here, 
we shall use this result to justify concentrating on corre- 
lated networks with C — 1, so that the only parameter 
we need to take into account is /3. It is worth mentioning 
that Pastor-Satorras et al. originally suggested using this 
exponent as a way of quantifying correlations [30|, since 
this seems to be the most relevant magnitude. Because 
(3 does not depend directly on p(k) (as r does), and can 
be defined for networks of any size (whereas r, in very 
heterogeneous networks, always goes to zero for large N 
due to its normalization (47j). we shall henceforth use (3 
as our assortativity parameter. 

So, after plugging Eq. © into Eq. ([T]), we find that 
the ensemble of networks exhibiting correlations given by 
Eq. (HI (and C = 1) is defined by the mean adjacency 
matrix 



1 

" N 

<7 2 



[hi 



1 



kj - (k)] 



a/3+2 N 



(fcjfc 



\P+1 

11 + 1 _ 1./3+1 

+1\ ' 3 



(kP + 1 ) 



(5) 



III. ANALYSIS AND RESULTS 

A. Mean field 

Let us consider the single-pattern case (P = 1, £, = 
Q). Substituting the adjacency matrix a for its expected 
value e (as given by Eq. ([5])) in the expression for the local 
field at i - which amounts to a mean-field approximation 
- we have 



MO 



C/3+2 



where we have defined 



(k a ) 



for a — 0, 1, j3 + 1. These order parameters measure 
the extent to which the system is able to recall informa- 
tion in spite of noise [28j . For the first order we have 
fj-o = m = (fiSi), the standard overlap measure in neu- 
ral networks (analogous to magnetization in magnetic 
systems), which takes account of memory performance. 
However, /xi, for instance, weighs the sum with the degree 
of each node, with the result that it measures information 
per synapse instead of per neuron. Although the overlap 
m is often assumed to represent, in some sense, the mean 



firing rate of neurological experiments, it is possible that 
/ii is more closely related to the empirical measure, since 
the total electric potential in an area of tissue is likely to 
depend on the number of synapses transmitting action 
potentials. In any case, a comparison between the two 
order parameters is a good way of assessing to what ex- 
tent the performance of neurons depends on their degree 
- larger-degree model neurons can in general store infor- 
mation at higher temperatures than ones with smaller 
degree can |27l |. 

Substituting Si for its expected value according to the 
transition probability, Si — > tanh(/ii/T), we have, for any 
a, 



or, equivalently, the following 3-D map of closed coupled 
equations for the macroscopic overlap observables [Xo, \x\ 
and fJ./3+i - which describes, in this mean-field approxi- 
mation, the dynamics of the system: 



Mo(* + l) 
M/3+i(*+l) 



p(k)tanh[F(t)/((k)T)]dk 



i / p(fc)fctanh[Fm/((fc)T)]dfc (6) 



p(k)k 0+1 tanh[F(i) /((fc)T)]dfc, 



with 



F(t) 



-^[^+i( M/3+ i(*)-M*)) 
(fc^X/xoW-zW*))]- 



This can be easily computed for any degree distribution 
p(k). Note that taking (3 = (the uncorrelated case) 
the system collapses to the 2-D map obtained in Ref. 
[27j, while it becomes the typical 1-D case for a homo- 
geneous p(k) - say a fully-connected network (l5| . It is 
in principle possible to do similar mean-field analysis for 
any number P of patterns, but the map would then be 
3P-dimensional, making the problem substantially more 
complex. 

At a critical temperature T c , the system will undergo 
the characteristic second order phase transition from a 
phase in which it exhibits memory (akin to ferromag- 
netism) to one in which it does not (paramagnetism). To 
obtain this critical temperature, we can expand the hy- 
perbolic tangent in Eqs. © around the trivial solution 
((1q, (ii, fip+i) ~ (0,0,0) and, keeping only linear terms, 
write 
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Mi 



(fc) 2 T c 



[(fc)Vl + cr 2 |U/J + i] > 



T c {k){k^ 



f7 /3+2 



/*0 



CT/3+2 



(fc ,+ l )2 _ (fc 2(, + l)^ 



Defining 
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B 
D 



0~2 

<fc) 2 ' 

q 2 (fc 2 ^ +1 >) - (fc^ +1 ) 2 

(k){kf ^ + 1 ) , 



T c will be the solution to the third order polynomial equa- 
tion: 

T c 3 - (B + 1)T 2 + (B - A)T C + A(£ - D) = 0. (7) 

Note that for neutral (i.e., uncorrelated) networks, /3 = 0, 
and so A = B = D. We then have T c = (fc 2 )/(fc) 2 , as 
expected [28j . 



B. Generating correlated networks 

Given a degree distribution p(fc), the ensemble of net- 
works compatible with this constraint and with degree- 
degree correlations according to Eq. ([4]) (with some expo- 
nent (3) is defined by the mean adjacency matrix e of Eq. 
([5]) - as described in Section Pl Cl and in Ref. [32[ . There- 
fore, although there will generally be an enormous num- 
ber of possible networks in this volume of phase space, 
we can sample them correctly simply by generating them 
according to e. To do this, first we have to assign to each 
node a degree drawn from p(k). If the elements of e were 
probabilities, it would suffice then to connect each pair of 
nodes with probability i^ to generate a valid net- 
work. Strictly speaking, e is an expected value, which 
in certain cases can be greater than one. To get round 
this, we write a probability matrix p = e/a with a some 
value such that all elements of p are smaller than one. 
If we then take random pairs of nodes and, with 

probability pij, place an edge between them, repeating 
the operation until \(k)N edges have been placed, the 
expected value of edges joining i and j will be iij . This 
method is like the hidden variable technique (37j in that 
edges are placed with a predefined probability (which is 
why the resulting ensemble is canonical). The difference 
lies in the fact that in the method here described corre- 
lations only depend on the degrees of nodes. 



We are interested here in neural networks, in which a 
given pair of nodes can be joined by several synapses, 
so we shall not impose the restriction of so-called simple 
networks of allowing only one edge at most per pair. We 
shall, however, consider networks with a structural cutoff: 
ki < \J (k)N, Vi [48| . This ensures that, at least for 
/3 < 0, all elements of i are indeed smaller than one. 

Because we can expect effects due to degree-degree 
correlations to be largest when p(k) is very broad, and 
since most networks in nature and technology seem 
to exhibit app roximately power-law degree distributions 
[3, 0, HE HU) we shall here test our general theoret- 
ical results against simulations of scale-free networks: 
p(k) ~ k 1 . This means that a network (or the region of 
phase space to which it belongs) is characterized by the 
set of parameters {{k),N, 7, /?}. 



C. Assortativity and dynamics 

In Fig. [5]we plot the stationary value of n\ against the 
temperature T, as obtained from simulations and Eqs. 
©, for disassortative, neutral and assortative networks. 
The three curves are similar at low temperatures, but as 
T increases their behavior becomes quite different. The 
disassortative network is the least robust to noise. How- 
ever, the assortative one is capable of retaining some in- 
formation at temperatures considerably higher than the 
critical value, T c — (k 2 )/(k), of neutral networks. A com- 
parison between /ii and no (see Fig. [3]) shows that it is 
the high degree nodes that are mainly responsible for this 
difference in performance. This can be seen more clearly 
in Fig. [31 which displays the difference [i\ — [Iq against 
T for the same networks. It seems that, because in an 
assortative network a sub-graph of hubs will have more 
edges than in a disassortative one, it has a higher effective 
critical temperature. Therefore, even when most of the 
nodes are acting randomly, the set of nodes of sufficiently 
high degree nevertheless displays associative memory. 

The phase diagram if Fig. [5] shows the critical temper- 
ature, T c , as obtained from Eq. ([7]). In addition to the 
effect reported in Ref. [27j whereby the T c of scale-free 
networks grows with degree heterogeneity (decreasing 7), 
it also increases very significantly with positive degree- 
degree correlations (increasing /3). 

At large values of AT, the critical temperature scales 
as T c ~ N b , with b > a constant. However, because 
the moments of k appearing in the coefficients of Eq. (J7|) 
can have different asymptotic behavior depending on the 
values of 7 and /?, the scaling exponent b differs from 
one region to another in the space of these parameters. 
These are the seven regions shown in Fig. [SI along with 
the scaling behavior exhibited by each one. This can be 
seen explicitely in Fig. where T Cl as obtained from 
MC simulations, is plotted against N for cases in each 
of the regions with 7 < 3. In each case, the scaling is 
as given by Eq. (J7J and shown in Fig. [5] For the four 
regions with 7 < 3, from lowest to highest assortativity 
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FIG. 2: Stable stationary value of the weighted overlap /ii 
against temperature T for scale-free networks with correla- 
tions according to k nn ~ fc' 3 , for /? = —0.5 (disassortative), 
0.0 (neutral), and 0.5 (assortative). Symbols from MC simu- 
lations, with errorbars representing standard deviations, and 
lines from Eqs. (|6]). Other network parameters as in Fig. [T] 
Inset: fii against T for the assortative case (/3 = 0.5) and 
different system sizes: N = 10 4 , 3 ■ 10 4 and 5 ■ 10 4 . 



FIG. 4: Difference between the stationary values and 
[la for networks with f3 = —0.5 (disassortative), 0.0 (neutral) 
and 0.5 (assortative), against temperature. Symbols from MC 
simulations, with errorbars representing standard deviations, 
and lines from Eqs. ([6]). Line shows the expected level of 
fluctuations due to noise, ~ N~^. Other parameters as in 
Fig. [I] 




result of assortativity. 



FIG. 3: Stable stationary values of order parameters /io, Mi 
and Ats+i against temperature T, for assortative networks ac- 
cording to P = 0.5. Symbols from MC simulations, with er- 
rorbars representing standard deviations, and lines from Eqs. 
([6]). Other parameters as in Fig. [T] 



we have scaling exponents which are dependent on: only 
7 (region I), only /3 (region II), both 7 and /3 (region 
III), and, perhaps most interestingly, neither of the two 
(region IV) - with T c scaling, in the latter case, as \/~N. 
As for the more homogeneous 7 > 3 part, regions V and 
VI have a diverging critical temperature despite the fact 
that the second moment of p(k) is finite, simply as a 
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FIG. 5: Phase diagrams for scale-free networks with 7 = 2.5, 
3, and 3.5. Lines show the critical temperature T c marking the 
second-order transition from a memory (ferromagnetic) phase 
to a memoryless (paramagnetic) one, against the assortativity 
13, as given by Eq. (J?}. Other parameters as in Fig. [1] 



IV. DISCUSSION 

We have shown that assortative networks of simple 
model neurons are able to exhibit associative memory 
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FIG. 6: Parameter space /3 — 7 partitioned into the regions 
in which &(/3, 7) has the same functional form - where b is 
the scaling exponent of the critical temperature: T c ~ TV 6 . 
Exponents obtained by taking the large N limit in Eq. ([7]). 




FIG. 7: Examples of how T c scales with N for networks 
belonging to regions I, II, III and IV of Fig. [6] (/3 = —0.8, 
—0.35, 0.0 and 0.9, respectively). Symbols from MC simu- 
lations, with errorbars representing standard deviations, and 
slopes from Eq. ifF}. All parameters - except for j3 and TV - 
are as in Fig. [T] 



in the presence of levels of noise such that uncorrelated 
(or disassortative) networks cannot. This may appear 
to be in contradiction with a recent result obtained us- 
ing spectral graph analysis - that synchronizability of a 
set of coupled oscillators is highest for disassortative net- 
works [33| . A synchronous state of model oscillators and 
a memory phase of model neurons are both sets of many 
simple dynamical elements coupled via a network in such 
a way that a macroscopically coherent situation is main- 



tained 29] . Obviously both systems require the effective 
transmission of infomation among the elements. So why 
are opposite results as regards the influence of topology 
reported for each system? The answer is simple: whereas 
the definition of a synchronous state is that every sin- 
gle element oscillate at the same frequency, it is precisely 
when most elements are actually behaving randomly that 
the advantages to assortativity we report become appar- 
ent. In fact, it can be seen in Fig. [2] that at low tempera- 
tures disassortative networks perform the best, although 
the effect is small. This is reminiscent of percolation: at 
high densities of edges the giant component is larger in 
disassortative networks, but in assortative ones a non- 
vanishing fraction of nodes remain interconnected even 
at densities below the usual percolation threshold [3lj ]. 
Because in the case of targeted attacks it is this thresh- 
old which is taken as a measure of resilience, we say that 
assortative networks perform the best. In general, the 
optimal network for good conditions (i.e., complete syn- 
chronization, high density of edges, low levels of noise) is 
not necessarily the one which performs the best in bad 
conditions (partial synchronization, low density of edges, 
high levels of noise). It seems that optimality - whether 
in resilience or robustness - should thus be defined for 
particular conditions. 

We have used the technique suggested in Ref. [32| 
to study the effect of correlations on networks of model 
neurons, but many other systems of dynamical elements 
should be susceptible to a similar treatment. In fact, 
Ising spins (46[, Voter Model agents [49j], or Boolean 
nodes 50], for instance, are similar enough to binary 
neurons that we should expect similar results for these 
models. If a moral can be drawn, it is that persistence 
of partial synchrony, or coherence of a subset of highly 
connected dynamical elements, can sometimes be as 
relevant (or more so) as the possibility of every element 
behaving in the same way. In the case of real brain 
cells, experiments su gge st that hub neurons play key 
functional roles [HI, |40| . From this point of view, 
there may be a selective pressure for brain networks to 
become assortative - although, admittedly, this organ 
engages in such complex behavior that there must be 
many more functional constraints on its structure than 
just a high robustness to noise. Nevertheless, it would 
be interesting to investigate this aspect of biological 
systems experimentally. For this, it should be borne 
in mind that heterogeneous networks have a natural 
tendency to become disassortative, so it is against the 
expected value of correlations discussed in Ref. [32J that 
empirical data should be contrasted in order to look for 
meaningful deviations towards assortativity. 
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